The "XML/Repetti" Project: Encoding and Manipulation of Temporal Information in Historical Text Sources

نویسندگان

  • Fabio Grandi
  • Federica Mandreoli
چکیده

The paper deals with the deployment of XML-related technologies in Cultural Heritage applications concerning the encoding of temporal semantics in the digital version of historical documents. Since written sources have often the same importance as material evidence in medieval archaeology, our approach can be applied to the development of tools for the support of archaeological research. In previous work, we developed an XML/XSL infrastructure called “The Valid Web” for the definition and management of historical information within Web documents. In this paper we describe the application and extension of such an approach to the realization of the electronic version of Repetti's historical-geographical dictionary of Tuscany. The extension concerns the uniform management of temporal indeterminacy, the use of multiple calendars and granularities and the proposed solutions have been inspired by similar research done for temporal query languages. From the user viewpoint, the proposed XML extensions allow the addition of historical metainformation to the encoded text sources and their “intelligent” temporal navigation via standard Web browsers. The project also involves the definition of optimized search algorithms, storage and temporal indexing of XML-encoded Repetti's Dictionary items, implementation of a prototype. As a byproduct, also a tool for computer-aided temporal XML-encoding of text sources will be developed to be used by Cultural Heritage operators (e.g. archaeology researchers).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The "XMURepetti" Project: Encoding and Manipulation of Temporal Information in Historical Text Sources

4BSTRACT The paper deals with the deployment of XML-related technologies in Cultural Heritage applications concerning the :ncoding of temporal semantics in the digital version of historical documents. Since written sources have often the same importance as material evidence in mcdicval archaeology, our approach can be applied to the development of tools for the support of archaeological researc...

متن کامل

خوشه‌بندی فراابتکاری اسناد فارسی اِکس‌اِم‌اِل مبتنی بر شباهت ساختاری و محتوایی

Due to the increasing number of documents, XML, effectively organize these documents in order to retrieve useful information from them is essential. A possible solution is performed on the clustering of XML documents in order to discover knowledge. Clustering XML documents is a key issue of how to measure the similarity between XML documents. Conventional clustering of text documents using a do...

متن کامل

Creating an XML Vocabulary for Encoding Lute Music

We describe the development of an XML representation, called TabXML, for encoding historical sources of lute music. These sources employ a special notation type, tablature, that is very hard to understand for non-lutenists. This paper discusses several issues in creating TabXML: 1. what to represent: the notational meaning or the text of the tablature, and how to represent it; 2. an analysis of...

متن کامل

Arabic News Articles Classification Using Vectorized-Cosine Based on Seed Documents

Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...

متن کامل

Exploiting Semantic Web Technologies for Intelligent Access to Historical Documents

The FDR/Pearl Harbor Project involves the enhancement of materials drawn from the Franklin D. Roosevelt Library and Digital Archives, which includes a range of image, sound, video and textual data. The project is undertaking the encoding, annotation, and multi-modal linkage of a portion of the collection, and enhancement of a web-based interface that enables exploitation of state-of-theart meth...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001